Solving zero-sum one-sided partially observable stochastic games
نویسندگان
چکیده
Many security and other real-world situations are dynamic in nature can be modelled as strictly competitive (or zero-sum) games. In these domains, agents perform actions to affect the environment receive observations -- possibly imperfect about situation effects of opponent's actions. Moreover, there is no limitation on total number an agent that is, fixed horizon. These settings partially observable stochastic games (POSGs). However, solving general POSGs computationally intractable, so we focus a broad subclass called one-sided POSGs. games, only one has information while their opponent full knowledge current situation. We provide picture for POSGs: (1) give theoretical analysis value functions, (2) show variant value-iteration algorithm converges this setting, (3) adapt heuristic search POSGs, (4) describe how use approximate functions derive strategies game, (5) demonstrate our solve non-trivial sizes analyze scalability three different domains: pursuit-evasion, patrolling,
منابع مشابه
Dynamic Programming for One-sided Partially Observable Pursuit-evasion Games
We study two player pursuit-evasion games with concurrent moves, infinite horizon, and discounted rewards. The players have partial observability, however, the evader is given an advantage of knowing the current position of the units of the pursuer. We show that (1) value functions of this game depend only on the position of the pursuing units and the belief the pursuer has about the position o...
متن کاملDefinable Zero-Sum Stochastic Games
Definable zero-sum stochastic games involve a finite number of states and action sets, reward and transition functions that are definable in an o-minimal structure. Prominent examples of such games are finite, semi-algebraic or globally subanalytic stochastic games. We prove that the Shapley operator of any definable stochastic game with separable transition and reward functions is definable in...
متن کاملDynamic Programming for Partially Observable Stochastic Games
We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable Markov decision processes (POMDPs) and iterative elimination of dominated strategies in normal form games. We prove that it iteratively eliminates very weakly dominated strategies without first forming the normal form r...
متن کاملPlanning for Weakly-Coupled Partially Observable Stochastic Games
Partially observable stochastic games (POSGs) provide a powerful framework for modeling multi-agent interactions. While elegant and expressive, the framework has been shown to be computationally intractable [Bernstein et al., 2002]. An exact dynamic programming algorithm for POSGs has been developed recently, but due to high computational demands, it has only been demonstrated to work on extrem...
متن کاملDynamic Programming Approximations for Partially Observable Stochastic Games
Partially observable stochastic games (POSGs) provide a rich mathematical framework for planning under uncertainty by a group of agents. However, this modeling advantage comes with a price, namely a high computational cost. Solving POSGs optimally quickly becomes intractable after a few decision cycles. Our main contribution is to provide bounded approximation techniques, which enable us to sca...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Artificial Intelligence
سال: 2023
ISSN: ['2633-1403']
DOI: https://doi.org/10.1016/j.artint.2022.103838